nlp_architect.data.cdc_resources.data_types.wiki package

Submodules

nlp_architect.data.cdc_resources.data_types.wiki.wikipedia_page module

class nlp_architect.data.cdc_resources.data_types.wiki.wikipedia_page.WikipediaPage(orig_phrase: str = None, orig_phrase_norm: str = None, wiki_title: str = None, wiki_title_norm: str = None, score: int = 0, pageid: int = 0, description: str = None, relations: nlp_architect.data.cdc_resources.data_types.wiki.wikipedia_page_extracted_relations.WikipediaPageExtractedRelations = None)[source]

Bases: object

toJson() → Dict[KT, VT][source]

nlp_architect.data.cdc_resources.data_types.wiki.wikipedia_page_extracted_relations module

class nlp_architect.data.cdc_resources.data_types.wiki.wikipedia_page_extracted_relations.WikipediaPageExtractedRelations(is_part_name: bool = False, is_disambiguation: bool = False, parenthesis: Set[str] = None, disambiguation_links: Set[str] = None, categories: Set[str] = None, aliases: Set[str] = None, be_comp: Set[str] = None, disambiguation_links_norm: Set[str] = None, categories_norm: Set[str] = None, aliases_norm: Set[str] = None, title_parenthesis_norm: Set[str] = None, be_comp_norm: Set[str] = None)[source]

Bases: object

static extract_categories(line: str) → Set[str][source]
extract_relations_from_text_v0(text)[source]
static find_in_line(text: str, pattern: str) → bool[source]
static is_name_part(line: str) → bool[source]
toJson() → Dict[KT, VT][source]

nlp_architect.data.cdc_resources.data_types.wiki.wikipedia_pages module

class nlp_architect.data.cdc_resources.data_types.wiki.wikipedia_pages.WikipediaPages[source]

Bases: object

add_page(page)[source]
get_and_set_all_aliases()[source]
get_and_set_all_categories()[source]
get_and_set_all_disambiguation()[source]
get_and_set_be_comp()[source]
get_and_set_parenthesis()[source]
get_and_set_titles()[source]
get_pages()[source]
toJson()[source]

Module contents